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Abstract: R-loops are cellular structures composed of an 
RNA/DNA hybrid, which is formed when the RNA 
hybridises to a complementary DNA strand and a 
displaced single-stranded DNA. R-loops have been 
detected in various organisms from bacteria to mammals 
and play crucial roles in regulating gene expression, DNA 
and histone modifications, immunoglobulin class switch 
recombination, DNA replication, and genome stability. 
Recent evidence suggests that R-loops are also involved in 
molecular mechanisms of neurological diseases and 
cancer. In addition, mutations in factors implicated in R- 
loop biology, such as RNase H and SETX (senataxin), lead 
to devastating human neurodegenerative disorders, 
highlighting the importance of correctly regulating the 
level of R-loops in human cells. In this review we 
summarise current advances in this field, with a particular 
focus on diseases associated with dysregulation of R-loop 
structures. We also discuss potential therapeutic ap- 
proaches for such diseases and highlight future research 
directions. 



Introduction 

R-loops are three-stranded structures, which form when RNA 
hybridises to a complementary DNA strand, forming an RNA/ 
DNA hybrid, resulting in displacement of the other DNA strand in 
this process (Figure 1). The first R-loops were described in 1976, 
when their formation in vitro in the presence of 70% formamide 
was visualised by electron microscopy (Figure 1) [1]. These 
structures were thermodynamically more stable than duplex 
DNA, and they remained intact following removal of formamide. 
This technique of RNA/DNA hybridisation has been used in over 
1 40 studies to map gene organisation, transcription initiation sites, 
and the direction of transcription, as well as measure the quantities 
of cellular RNAs [2]. 

The first evidence for R-loop formation in live bacteria was 
obtained in 1994 [3]. This was followed by numerous studies 
showing that R-loops exist in different organisms (Figure 1) [4—6]. 
In living cells, R-loops are thought to form in cis during 
transcription, when nascent RNA hybridises to the DNA template 
behind the elongating RNA polymerase (Pol) [4]. However, in 
contrast to this popular view of cotranscriptional R-loops, recent 
studies suggest that RNA transcribed at one locus can hybridise to 
homologous DNA at another locus, thus leading to R-loop 
formation in trans [7] . In the last five years, the use of an antibody 
(S9.6) recognising RNA/DNA hybrids has revolutionised the R- 
loop field [8]. Initially, the S9.6 antibody, which detects hybrids as 
small as six bp with an affinity of 0.6 nM, was developed as a tool 
to enhance the DNA/RNA hybridisation signal in DNA 
microarray studies [9,10]. More recently, it has been used to 
detect R-loops in vivo and uncover their contribution to 
fundamental biological processes in yeast [11,12], plants [13], 
mice [14,15], and humans [16—18]. 

The picture emerging from these studies suggests that R-loops 
can be both beneficial and deleterious to cells. Paradoxically, while 



they are required for important biological processes, they can also 
promote DNA damage and genome instability. In particular, R- 
loops have been shown to play an essential positive function in 
Escherichia coli plasmid and human mitochondrial DNA replica- 
tion [19,20] and during immunoglobulin class switch recombina- 
tion, which contributes to the antibody isotype diversity in 
activated B cells [21]. R-loops form on many genes in yeast and 
human cells [18,22] and have been implicated in regulation of 
gene expression. R-loops can repress transcription and promote 
transcriptional termination [16,23,24]. Furthermore, R-loops are 
clearly associated with epigenetic mechanisms governing tran- 
scription, including DNA methylation and posttranslational 
histone modifications [18,25-27]. In spite of this growing list of 
beneficial R-loop functions, it is also evident that R-loops can be a 
dangerous source of DNA damage. They can sensitize DNA to 
damaging agents [28], induce transcription-associated recombi- 
nation [24], double-strand breaks (DSBs) [29,30], chromosome 
breaks, and fragile site instability [31-33], and cause chromosome 
loss [34] . Therefore, cells need to tighdy regulate the levels of R- 
loops to exploit their unique features. Altering the physiological R- 
loop balance can impair R-loop-regulated processes, cause 
genome instability, and may lead to human diseases. Consequent- 
ly, defining the roles of R-loops in the multitude of biological 
processes and human disease is likely to develop into one of the 
most important and influential areas of R-loop research in the 
future. 

Proteins in R-loop Biology 

The number of proteins associated with R-loop biology has 
increased in the last few years, reflecting the diversity of R-loop 
processes (Table SI) [4-6]. Many proteins can regulate cellular R- 
loop levels either directly or indirectly, mostiy by preventing RNA 
from hybridising to DNA, thus reducing excessive R-loop 
accumulation. Among these are proteins required for efficient 
transcriptional elongation, termination, polyadenylation, RNA 
splicing, packaging, and export [16,24,28,30,31,34,35]. DNA 
topology itself can influence hybridisation of RNA to DNA, and 
topoisomerases consequently play important roles in modulating 
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Figure 1. History of R-loop research. The diagram depicts major developments in the R-loop field and diseases associated with R-loop 
dysregulation. 
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R-loop levels [27,33]. Proteins involved in maintenance of genome 
integrity can also regulate R-loops, suggesting a dynamic interplay 
between DNA repair and R-loop formation [7] . Importantly, cells 
possess dedicated enzymes, including the members of the RNase 
H family that specifically degrade the RNA in R-loops [36], and 
helicases that can unwind RNA/DNA hybrids [12,16]. 

Recent evidence shows that R-loops can directiy affect many 
gene expression-associated processes, including DNA methyla- 
tion, posttranslational histone modifications, and transcription, by 
influencing the function of regulatory proteins [16,18,25,26]. 
Despite the growing number of proteins involved in R-loop 
homeostasis and human disease, many questions still remain 
unanswered. For many proteins with documented in vitro RNA/ 
DNA helicase activity (e.g., Pifl, the MCM complex), in vivo 
evidence is generally still lacking (Table S 1) [37,38] . Moreover, the 
molecular mechanisms underlying interactions between proteins 
and R-loops are poorly understood, and in many cases the 
connections to disease remain obscure. 

R-loops and Neurological Diseases 

The biological importance of R-loops in humans is supported 
by the fact that mutations in proteins implicated in R-loop 
resolution cause devastating human diseases, often related to 
neurodegeneration. Mutations in the putative RNA/DNA helicase 
SETX cause neurodegenerative diseases, the dominant juvenile 
form of amyotrophic lateral sclerosis type 4 (ALS4), and a recessive 
form of ataxia oculomotor apraxia type 2 (AOA2) (Figure 2A). 
These diseases are characterised by progressive degeneration of 
motor neurons in the brain and spinal cord, muscle weakness and 
atrophy [39-41]. 

In addition to its predicted function as an RNA/DNA helicase, 
SETX interacts with proteins involved in diverse aspects of RNA 
metabolism [42]. Moreover, a single amino acid mutation, which 
compromises the function of the yeast homologue Sen 1 , dramat- 
ically changes the Pol II distribution genome-wide, further 
supporting the view that SETX/Senl functions in the regulation 
of transcription [43] . Recently, we demonstrated that SETX is 
implicated in transcriptional termination by Pol II in humans [16]. 
It is required to resolve R-loops at termination elements, releasing 
RNA for degradation by the 5 '-3' "torpedo" exonuclease Xrn2 
prior to termination (Figure 2 A) [16,44]. Mutations in the yeast 
homologue, Senl, also lead to a transcriptional termination defect, 
associated with accumulation of R-loops and genome instability 
[12]. In line with its function in R-loop resolution, SETX/Senl is 
also involved in maintaining genome integrity by coordinating 
transcription, DNA replication, and the DNA damage response 



[45-47]. SETX can target the 3 '-5' RNA degradation complex, 
the exosome, to sites of transcription- induced DNA damage [48] . 
Furthermore, SETX protects genome integrity by coordinating 
meiotic recombination with transcription during spermatogenesis 
and gene silencing during meiotic sex chromosome inactivation [14]. 
In particular, Setx knock-out mice accumulated DNA double strand 
breaks and R-loops and failed to disassemble Rad5 1 filaments. This 
resulted in a failure to cross over, likely due to collision between R- 
loops with Holliday junctions [14]. These defects in Setx knock-out 
mice lead to male infertility, raising the question as to how this relates 
to fertility of male AOA2/ALS4 patients. 

Studies in neuronal cells have demonstrated a role for SETX in 
neuronal differentiation through fibroblast growth factor 8 (FGF8) 
signalling, providing one explanation for the effects of loss-of- 
function AOA2 mutations [49]. Surprisingly, overexpression of 
dominant mutant forms of SETX did not affect neuritogenesis, 
suggesting that a different function of SETX may be affected in 
ALS4 patients. However, the interplay between the function of 
SETX in R-loop resolution, genome maintenance, and neuronal 
differentiation is still unclear. In a recent study, Lavin and 
colleagues examined cells from mice with disrupted Atm, Tdpl, 
Setx, or Aptx genes, which cause ataxia telangiectasia (AT), 
spinocerebellar ataxia with axonal neuropathy 1 (SCAN1), AOA2, 
and ataxia oculomotor apraxia type 1 (AOA1) disorders, 
respectively [15]. These diseases are characterised by a defective 
response to DNA damage, suggesting that R-loops may be 
implicated in triggering genome instability. Indeed, R-loops were 
found to be enriched in proliferating cells (testes), but not in the 
brain tissues from Setx, Atm, Tdpl or Aptx knock-out mice [15]. 
The enrichment of R-loops in testes correlated with high levels of 
DNA damage and apoptosis. The lack of R-loops in brain tissue 
questions the association between R-loops and neurodegeneration. 
This result is surprising, because inducible R-loops have been 
previously detected in neuronal cells at the Snnrdl 16 locus, which 
is associated with the neurodevelopmental disorder Angelman 
syndrome, as discussed below [50]. Furthermore, R-loops were 
implicated in inducing DNA damage in nonproliferating cells and 
post-mitotic neurons and proposed to contribute to the neurode- 
generation seen in AT patients [29] . It is possible that R-loops are 
regulated by different mechanisms in proliferating cells and post- 
mitotic neurons, thereby leading to different R-loop kinetics and so 
preventing their detection in some model systems. In particular, R- 
loop accumulation may reflect collisions between transcription and 
replication machineries [32,51], events which do not occur in 
postmitotic neurons. It should be noted that the mouse models 
currently used may not fully recapitulate all aspects of human 
neurodegeneration. 
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Figure 2. R-loops and human diseases. The diagram depicts the role of R-loops in human diseases. Loss of wild type protein function is depicted 
by red crosses. A. Ataxia and motor neuron diseases. Mutations in human RNA/DNA helicase senataxin are associated with AOA2/ALS4 disorders and 
lead to R-loop accumulation and defects in transcriptional termination by Pol II [16], the maintenance of genome integrity [46], meiotic 
recombination during spermatogenesis, gene silencing during meiotic sex chromosome inactivation [14], and neuronal differentiation [49]. B. 
Aicardi-Goutieres syndrome (AGS). AGS is associated with mutations in all three subunits of RNase H2, ssDNA 3'-5' exonuclease TREX1 (DNASEIII), 
dsRNA-editing enzyme ADAR1 , and dNTP triphosphatase SAMHD1 ; these trigger accumulation of unprocessed nucleic acids, including genomic DNA 
with incorporated ribonucleotides, R-loops, and retroelement-derived nucleic acids, and result in the immune response characteristic of AGS [65], C. 
Trinucleotide expansion diseases. R-loops form over expanded repeats and result in decreased initiation and elongation of RNA Pol II and formation 
of repressive chromatin marks, which silence the host gene containing expanded repeats [75], D. Genome instability in cancer. Loss of proteins 
protecting against abnormal R-loop accumulation, such as FIP1L1, leads to genome instability, one hallmark of cancer [31]. Yellow stars denote 
double-stranded DNA breaks. E. AID-mediated mutagenesis and translocations in cancer. Single-stranded DNA in R-loops is a substrate for cytidine 
deamination by activation-induced cytidine deaminase, leading to mutagenesis as indicated by orange stars [21,88]. These mutations can cause DSB 
formation, leading to chromosomal translocations. The IgH/c-MYC translocation brings the strong IgH enhancers, shown as yellow box, close to 
c-MYC, leading to its overexpression in Burkitt's lymphoma [87]. Transcription of IgH/c-MYC starts from a previously inactive promoter downstream of 
the translocation break point. The IgH locus is depicted in blue, c-MYC gene is in grey. The translocation breakpoint is indicated by a dashed black 
line. F. Senescence. R-loops formed by the noncoding RNA TERRA accumulate at telomeres in cells deficient of Hpr1 and RNase H. In the absence of 
telomerase, these R-loops promote Rad52-dependent telomere elongation and delayed senescence. In the absence of telomerase and Rad52, R-loops 
promote telomere shortening and premature senescence [94]. 
doi:10.1371/journal.pgen.1004630.g002 
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RNase H and Aicardi-Goutieres Syndrome (AGS) 

In addition to their generation during transcription, RNA/ 
DNA hybrids can arise due to incorporation of ribonucleotides 
into DNA by DNA polymerases during replication. RNase H 
enzymes are endonucleases that cleave the RNA of RNA/DNA 
hybrids in a sequence-independent manner, thus maintaining 
genome stability by resolving R-loops that form during transcrip- 
tion and by removing misincorporated ribonucleotides from the 
DNA [36]. Eukaryotic cells have two types of these enzymes, 
RNase HI and RNase H2, which have different enzymatic and 
site-specific activities [52]. In particular, RNase HI requires a tract 
of at least four ribonucleotides to cleave the RNA/DNA hybrid, 
whereas RNase H2 can incise 5' to a single ribonucleotide 
incorporated within a DNA molecule [36,52]. Therefore, only 
RNase H2 can process single ribonucleotides in the DNA, but 
both enzymes are capable of eliminating RNA/DNA hybrids. 
Unlike in bacteria and unicellular eukaryotic organisms, where 
RNase H enzymes are dispensable for viability, both RNase H 
enzymes are essential in higher eukaryotes. RNase HI has been 
implicated in mitochondrial DNA (mtDNA) replication during 
mouse development, a process likely to be associated with 
processing of RNA primers during mtDNA replication [53]. 

RNase H2 is composed of three different subunits, the catalytic 
subunit 2A, and two other subunits, 2B and 2C, all of which are 
required for enzyme activity. RNase H2 has been implicated in 
recognition and removal of ribonucleotides incorporated into 
DNA and hydrolysis of Okazaki fragment RNA primers during 
DNA replication [36,54—57]. In addition, recent studies point 
towards a role of RNase H2 in R-loop resolution during 
transcription in vivo [11,58]. In particular, deletion of Saccharo- 
myces cerevisiae RNase H2 imposes transcriptional blocks and R- 
loop accumulation over rDNA regions in cells depleted of 
Topoisomerase I [11] and transcriptional down-regulation of 
genes with higher guanine-cytosine (GC) content at the promoter 
regions, which are likely to form stable R-loops [58]. 

In humans, mutations in any of the three subunits of RNase H2 
cause Aicardi-Goutieres syndrome (AGS), a neurological inflam- 
matory disorder, which resembles a congenital viral infection and 
is associated with accumulation of ribonucleotides in the DNA 
(Figure 2B) [59,60]. Interestingly, AGS can also be triggered by 
mutations in single-stranded DNA (ssDNA) 3 '-5' exonuclease 
TREXl(DNASEIII) [61], double-stranded RNA (dsRNA)-editing 
enzyme ADAR1 [62], and dNTP triphosphatase SAMHD 1 [63] . 
These proteins are involved in diverse pathways of nucleic acid 
metabolism, although their functions are not yet fully understood. 
They have been implicated in degrading ssDNA arising from 
endogenous retroelements or replication stress (TREX1), regulat- 
ing the intracellular dNTPs pool available for replication and 
reverse transcription of these retroelements (SAMHD 1), or 
altering the immune response to RNA species through RNA 
editing of retroelements and microRNAs (ADAR1) [64]. Muta- 
tions in these proteins are associated with an accumulation of 
unprocessed nucleic acids, which triggers the immune response 
characteristic of AGS [64,65]. 

So far, pathologies linked to AGS mutations in RNase H2 have 
been mainly attributed to genome instability caused by accumu- 
lation of ribonucleotides in DNA [56,66]. However, a specific 
contribution of R-loops and RNA/DNA hybrids to AGS 
pathology has not been yet investigated. This research has been 
hampered by the difficulty to uncouple the two activities of RNase 
H2; its ability to remove ribonucleotides from the DNA and to 
resolve R-loops, both of which are affected when RNase H2 is 
deleted [52,56]. Nevertheless, several lines of evidence suggest that 



R-loops may be involved in AGS pathology. Thus, an AGS- 
related mutation in the yeast RNase H2 enzyme resulted in its 
reduced RNA/DNA cleavage activity [52]. Since RNase H2 
constitutes —90% of the total cellular RNA/DNA hybrid cleavage 
activity, its loss due to AGS mutations may lead to significant 
accumulation of R-loops [56] . The importance of RNase H2 is 
further highlighted by the fact that mutations in RNase HI do not 
cause AGS, suggesting that RNase H2 may have unique properties 
to degrade RNA/DNA hybrids [52]. Indeed, R-loops arising 
during DNA replication may be exclusively degraded by RNase 
H2, as they may be inaccessible to RNase HI [52,67]. A recendy 
generated S. cerevisiae RNase H2 mutant, which possesses R-loop 
degrading activity but fails to remove single ribonucleotides from 
the DNA [52], will be a useful tool in addressing the contribution 
of unresolved transcription-associated R-loops to AGS pathology. 

TREX1, ADAR1 and SAMHD 1 process retroelement-derived 
nucleic acids and help to suppress retroelements expansion in the 
host genome and their recognition by the immune system [64]. 
Interestingly, recent genome-wide studies have demonstrated that 
RNA/DNA hybrids are particularly enriched at retrotransposon 
elements in yeast cells [22], suggesting that expansion of retro- 
elements due to mutations in TREX1, ADAR1 or SAMHD 1 may 
lead to increased RNA/DNA hybrid levels, contributing to 
autoimmunitity in AGS. Indeed, it has recendy been demonstrated 
that RNA/DNA hybrids can be sensed by toll-like receptor 9 
(TLR9) to induce pro-inflammatory cytokine and antiviral 
interferon production in dendritic cells [68] . 

R-loops in Nucleotide Expansion Diseases 

Expansions of repetitive sequences have been linked to over 
forty human diseases [69], and R-loops have been proposed to 
play a role in their pathology [70-73]. Remarkably, R-loops are 
formed following transcription of trinucleotide repeats in vitro, in 
bacteria and human cells [70,71,73]. Interestingly, the nontem- 
plate DNA strand in many repetitive sequences can adopt unusual 
DNA structures, including G-quadruplexes and DNA triplexes, 
which may further stabilise R-loops [74]. Moreover, R-loops 
formed at CTG repeats promote repeat instability characteristic of 
these diseases [71]. 

Recently, we demonstrated that R-loops form over expanded 
GAA and CGG repeats in cells from Friedreich's Ataxia (FRDA) 
and Fragile X syndrome (FXS) patients, respectively (Figure 2C) 
[75]. The abundance of these stable R-loops correlates with 
expansion size, and they colocalise with the repressive chromatin 
marks characteristic of these diseases (Figure 2C). R-loops can also 
trigger the formation of repressive chromatin and cause transcrip- 
tional silencing of the FXN gene, providing a molecular link 
between R-loops and the pathology of expansion diseases [75]. In 
line with R-loops formed on expanded "premutation" and "full 
mutation" GGG-repeat-containing alleles of the FMR1 gene 
[75,76], promoter-bound FMR1 mRNA containing trinucleotide 
repeats was shown to promote epigenetic silencing in FXS [77]. 
Importandy, the involvement of R-loops in expansion diseases is 
not limited to trinucleotide repeats, since R-loops associated with 
expanded hexanucleotide GGGGCG repeats in C9orf72 contrib- 
ute to the molecular event leading to amyotrophic lateral sclerosis 
(ALS) and frontotemporal dementia (FTD) [78]. 

R-loops could contribute to the pathology of expansion diseases 
in various ways. Similar to R-loops at the 3'ends of human genes, 
expansion-associated R-loops may form a structural block, directly 
interfering with Pol II transcriptional elongation [16,24]. Alterna- 
tively, R-loops may nucleate repressive chromatin over the 
expansion region, by analogy with heterochromatin formation at 
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centromeres in Schizosaccharomyces pombe [25], or promote 
chromatin compaction associated with histone H3S10 phosphor- 
ylation, as observed in S. cerevisiae, Caenorhabditis elegans, and 
human cells [26]. Furthermore, R-loops could cause the charac- 
teristic intergenerational and somatic instability of repeat sequenc- 
es [72]. 

R-loops in Cancer 

Genome instability is a hallmark of cancer, and it may actively 
drive hereditary tumour development [79,80]. Research in the last 
decade has clearly demonstrated that dysregulation of R-loops can 
corrupt genome integrity, resulting in increased DNA sensitivity to 
damaging agents, formation of DSBs, chromosome breaks, fragile 
site instability, chromosome loss, and recombination events [5]. 
Several mechanisms have therefore evolved to maintain R-loop 
levels in balance, and alterations in genome caretaker processes can 
affect R-loop levels and genome stability [4] . Moreover, mutations 
in proteins controlling R-loop levels have been identified in tumours 
(Figure 2D). For example, in eosinophilic leukemia, an oncogenic 
translocation renders cleavage and polyadenylation factor FIP1L1 
inactive, which has been previously shown to cause increased R- 
loop levels, DNA damage and chromosome instability (Figure 2D) 
[31]. A similar mechanism was suggested for RNA kinase CLP1, 
which is associated with a translocation in mixed lineage leukemia 
(MLL) [31]. The histone ubiquitin ligase BRE1 also limits R-loop 
levels, and its decreased expression may contribute to the high levels 
of genomic instability observed in testicular seminoma [81]. 

The link between R-loops and cancer has been further 
substantiated by the finding that the tumour suppressor BRCA2, 
which is mutated in breast and ovarian cancer, is required to prevent 
R-loop accumulation and genome instability [82], These observa- 
tions raise the interesting possibility that R-loops may provide 
proliferative advantages to tumour cells by promoting genome 
instability. This will in turn increase the probability of accumulating 
mutations favourable to tumour growth and metastasis. Intriguingly, 
recent evidence demonstrates that human oncogenic viruses may 
also promote genomic instability through accumulation of R-loops 
after infection. Kaposi's sarcoma-associated herpesvirus (KSHV), 
which causes multiple AIDS-related cancers, encodes the ORF57 
protein, which can sequester the host hTREX complex, important 
for mRNA processing and export [83] . Sequestration of hTREX 
leads to KSHV-induced accumulation of R-loops and causes 
damage to the host DNA, contributing to tumourigenesis [83] . 

Whilst some proteins suppress R-loop formation, others may 
promote R-loops and so increase genome instability leading to 
tumour development. This unexpected function has been shown in 
yeast for transcription elongation factor Spt2 and DNA repair 
protein Rad51 [7,84]. Overexpression of Spt2 leads to transcription- 
dependent chromosomal rearrangements, which are prevented by 
RNase H overexpression [84] . Spt2 is structurally related to human 
HMG1, which is overexpressed in gastric cancers and malignant 
melanomas [84] . However, it is not clear if increased HMG 1 levels 
promote R-loops and DNA damage in cancer cells. In contrast to its 
well-established role in DNA strand exchange during homologous 
recombination and DNA repair [85], recent studies have shown that 
Rad5 1 can also mediate R-loop formation and genome instability in 
trans, extending the prevailing view that R-loops form cotranscrip- 
tionally [7] . Similar to HMG 1 , RAD5 1 is overexpressed in human 
cancers [7]. However, it remains to be elucidated if RAD51 
overexpression in cancers is a consequence of activated DNA repair 
pathways, or a cause of genome instability [7] . 

R-loops have been detected in immunoglobulin (Ig) genes, 
where they initiate class switch recombination by exposing single- 



stranded DNA, thus providing the substrate for activation-induced 
cytidine deaminase (AID), which promotes DSBs and subsequent 
translocation between Ig heavy chains [21,86]. Although this 
process is essential for generation of antibody isotype diversity, 
AID-mediated mutagenesis has also been implicated in patholog- 
ical translocations between the Ig loci and other active genes, 
leading to production of fusion proteins or oncogenic gene 
expression, observed in B cell malignancies (Figure 2E) [87]. 
Interestingly, R-loops are also found in common translocation 
partners of Ig genes, including the oncogene c-MYC [18,27]. 
Therefore, the simultaneous formation of R-loops in Ig and 
transcribed non-Ig genes may induce AID-mediated DSB 
formation, leading to pathological translocations (Figure 2E) 
[27,88,89]. Interestingly, overexpression of the APOBEC family 
of AID-related enzymes in breast cancer have been linked to 
genomic mutations, pointing to a potentially broader role of R- 
loops and AID/APOBEC-mediated genome instability in cancer 
[90]. 

Changes in gene expression are another central aspect of cancer 
[79]. In healthy cells, the expression of tumour suppressor genes 
prevents abnormal proliferation and other aspects of tumourigen- 
esis [79]. Tumour suppressors are frequently silenced in cancer by 
excessive promoter DNA methylation [91]. It has been proposed 
that R-loop formation at promoters protects against DNA 
methylation by de novo DNA methyltransfera.se DNMT3B, 
thereby keeping genes active [18]. Since R-loops have been 
computationally predicted to form at promoters of tumour 
suppressor genes BRCA1, RASSF1A, and CDKN2A [92], it is 
important to investigate if R-loop levels at these genes are reduced 
in cancer and how this relates to the observed DNA hypermethy- 
lation. 

In contrast to this, efficient transcription of the oncogene c-MYC 
requires that R-loop levels are kept low by the activity of DNA 
topoisomerase IIIB, which is recruited to arginine-methylated 
histones by the tudor domain containing 3 (TDRD3) protein [27]. 
This R-loop-mediated mechanism of c-MYC gene regulation may 
be relevant to tumour progression in breast cancer, which 
frequently shows overexpression of both c-MYC and TDRD3 
[27,93]. Therefore, it is tempting to speculate that increased 
TDRD3 levels suppress R-loops in c-MYC, thereby allowing its 
enhanced expression, which correlates with poor cancer prognosis 
[93]. However, it still remains to be determined if R-loops play a 
specific role in transcription dysregulation in cancer and if this 
process differs from R-loop-mediated transcriptional programmes 
associated with housekeeping genes. 

More recently, R-loops have been implicated in cell senescence, a 
mechanism protecting against tumour cell proliferation [79]. In 
particular, the telomeric noncoding (nc) RNA TERRA forms R- 
loops which are induced when R-loop suppressors such as RNase H 
or Thp2 are lost [94,95]. In the absence of telomerase, telomeric R- 
loops promote recombination-mediated telomere elongation via 
Rad52, and this delays the onset of cellular senescence [94]. In 
contrast, in Rad5 2 -deficient cells, R-loop accumulation leads to 
telomere shortening and premature senescence [94] . Interestingly, 
cells from AOA2 patients with senataxin mutations contain shorter 
telomeres, suggesting a possible involvement of SETX in telomere 
stability [96]. Telomeric R-loops therefore play a complex and 
dynamic role in telomere length maintenance and cellular 
proliferative potential (Figure 2F). 

In conclusion, multiple lines of evidence point to an involve- 
ment of R-loops in cancer biology. Yet it still remains to be 
investigated if R-loop levels are indeed regulated differentially in 
normal and tumour tissues and if they can directly influence 
tumourigenesis. 
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R-loop Therapies 

R-loops represent a potential therapeutic target. Despite their 
importance in gene regulation, they have yet to be fully exploited 
in drug design [97]. Various ligands can target RNA/DNA 
hybrids, including ethidium bromide, the aminoglycosides neo- 
mycin and paramomycin, and the polyamides distamycin and 
netropsin [98]. These compounds recognise RNA/DNA hybrids 
through intercalation and binding to the nucleic acid groove. 
Although exhibiting high binding affinities to RNA/DNA hybrids, 
many of these molecules also bind dsDNA and RNA and are 
mutagenic, limiting their potential biological applications [98]. 
However, recent studies suggest that combining the properties of 
these ligands can achieve subnanomolar affinity for RNA/DNA 
hybrids. In particular, this has been demonstrated for ligands 
linking aminoglycosides to derivatives of ethidium bromide [99], 
providing a possible approach for the development of potent and 
specific RNA/DNA hybrid ligands in future drug design efforts. 

Various compounds that modulate DNA supercoiling and inhibit 
DNA topoisomerases, including topotecan and camptothecin, can 
also affect R-loop formation in vivo [29,50]. In particular, 
topoisomerase inhibitors have recently been used to reactivate the 
silenced paternal Ube3a gene, which encodes a ubiquitin E3 ligase, 
to compensate for the deleted maternal Ube3a in Angelman 
syndrome (AS). AS and Prader-Willi syndrome (PWS) are imprinted 
neurodevelopmental disorders that are often caused by large 
deletions of human chromosome 15qll-ql3 over the Snordll6 
gene locus, but the deletion differs in its parent-of-origin [100]. In 
neurons, only the maternal Ube3a allele is expressed, because the 
paternal Ube3a allele is silenced by expression of the ncRNA 
Ube3a-ATS (Figure 3A) [101]. AS therapies therefore seek to 
reactivate the silenced, but genetically intact, paternal Ube3a allele. 
Interestingly, R-loops were recendy shown to regulate the neuronal 
expression of the paternal Ube3a-ATS transcript, which is essential 
for transcriptional silencing of the paternal Ube3a gene [50] . In 
particular, treatment with the topoisomerase inhibitor topotecan 
increased R-loop levels over the Snordll6 locus, resulting in 
chromatin decondensation, inhibition of Pol II transcription of 
Ube3a-ATS, and concomitant increase in Ube3a expression from 
the paternal allele (Figure 3B). This R-loop-mediated reactivation 
of paternal Ube3a could therefore compensate for the loss of 
maternal Ube3a in AS and so potentially holds promise for targeted 
therapies for both AS and PWS (Figure 3B). 

It has previously been proposed that R-loops in trinucleotide 
expansion diseases could be targeted to suppress repeat expansions 
or reactivate silenced genes [72]. A recent study provided direct 
evidence that a small molecule is able to suppress R-loop formation 
at expanded CGG repeats in the FMR1 gene, thereby preventing 
FMR1 epigenetic silencing in FXS [77]. As an alternative 
approach, R-loop levels may be indirectiy modulated by treatments 
that target proteins involved in R-loop biology (Table SI). For 
instance, genomic instability caused by a widespread increase of R- 
loops due to loss of an R-loop suppressing protein could potentially 
be reverted by introduction of an alternative R-loop suppressor. 

Recent identification of small-molecule inhibitors for RNase H2 
may also provide a powerful new tool for the study of R-loop biology 
in health and disease [102]. Furthermore, the S9.6 antibody offers 
new opportunities for research and development. In particular, it 
has already been used in the development of biosensor systems 
[1 03] , detection of miRNA targets [1 04] , and as a key component of 
human papillomavirus (HPV) diagnostic kit (Qiagen). 

The explosion of studies uncovering the role of R-loops in 
health and disease in recent years provides the exciting prospect of 
developing new targeted therapeutics for many human disorders. 



However, due to the ubiquitous nature of R-loops it will be 
important to ensure that efficient treatments are specific. 

Conclusions and Future Challenges 

R-loops have been implicated in many biological processes in 
different organisms. R-loops can play positive and negative roles in 
gene expression; they can mediate Ig class switch recombination 
and transcriptional termination, affect genome stability, transcrip- 
tion, cell cycle progression, and cell viability. Despite the diversity 
of these biological processes, the molecular mechanisms associated 
with R-loop formation in mammalian cells remain largely 
unknown. It is unclear how R-loops can regulate gene expression, 
how they are maintained and eliminated in the cells, and which 
proteins are involved in the regulation of these processes. 

The connections between R-loops and human diseases suggest 
that cells have evolved mechanisms to distinguish between 
deleterious and beneficial R-loops. However, the evidence discussed 
above raises an important question: how can R-loop dysregulation 
be mechanistically linked to a variety of human diseases with such 
diverse pathologies? One explanation may be that R-loops form in 
many genomic locations in healthy cells [16,18,22,27]. Therefore, 
unsurprisingly, their dysregulation can affect a large number of 
disease-associated genes. This is in contrast to gene-specific R-loop 
pathologies, associated with mutations, which result in altered R- 
loop levels locally, as observed in the repeat expansion diseases 
FRDA and FXS [75,77]. Furthermore, R-loops can have different 
intrinsic properties. R-loops at expanded GAA repeats in the FXN 
gene are highly stable and trigger transcriptional repression, while R- 
loops in the highly-expressed y-Actin gene are easily turned over 
[75] . This could, in part, be due to differential activity of R-loop 
processing proteins on different classes of genes, as proposed in yeast 
[22]. Adding another layer of complexity, the formation of R-loops 
can be influenced by cell type [77], cell cycle stage [15], gene length, 
and/or GC content and transcriptional level [22,105]. Epigenetic 
marks including DNA methylation and post-translational histone 
modifications can contribute to further modulation of R-loop levels 
[18,27]. Thus, R-loops represent cellular structures that share the 
same elementary composition, but may possess different dynamic 
properties, which can be affected by any of the aforementioned 
processes, thus explaining the wide range of diseases associated with 
R-loops. 

Despite the lack of mechanistic insights into R-loop-associated 
diseases, some common themes, underlying their pathology, are 
already becoming obvious. First, there is a strong connection 
between R-loop dysregulation and induction of DNA damage and 
loss of genome integrity, which contributes to cancer development 
[31,81,88], repeat expansion diseases [71], and neurodegeneration 
[29,45]. Secondly, R-loops can mediate changes in transcription 
locally or globally, contributing to pathologies associated with 
repeat expansion diseases [75,77,78], Angelman syndrome [50], 
and cancer [27]. However, it is a strong possibility that both of 
these pathological themes may overlap in many disorders, as 
observed in repeat expansion diseases [71,75,77], and novel 
disease themes may be revealed in the future. 

One of the major challenges in R-loop field is to investigate the 
causes and consequences of R-loop formation in additional models 
of human disease. Uncovering further aspects of R-loop biology in 
human cells will certainly shed light on many basic biological 
questions and have major implications for our understanding of 
human disease. Future studies will undoubtedly reveal more 
diseases associated with R-loop dysregulation and will provide the 
basis for novel therapeutic approaches targeting these so far 
overlooked structures in gene expression. 
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Figure 3. Potential R-loop-based therapeutic approach in Angelman Syndrome (AS). A. Neuronal expression of the paternal ncRNA 
Ube3a-ATS represses paternal Ube3a gene in cis [101]. DNA methylation of the Snordl16 locus on the maternal allele prevents Ube3a-ATS 
transcription, resulting in Ube3a expression from the maternal allele. Transcriptional repression is indicated by red crosses. B. R-loop-mediated re- 
activation of silent paternal Ube3a gene provides a targeted therapy for AS. Deletion leading to the loss of maternal Ube3a expression detected in AS 
is indicated by the red dashed line. Topotecan treatment increases R-loop levels over the Snord116 locus, resulting in chromatin decondensation, 
inhibition of Pol II transcription through Ube3a-ATS, and increased expression of Ube3a from the paternal allele [50]. 
doi:1 0.1 371 /journal.pgen.1 004630.g003 



Supporting Information 

Table SI Proteins implicated in R-loop biology. For 

multiprotein complexes, only subunits directly implicated in R- 
loop biology are mentioned in the table. *Asterisk indicates that 
protein association with R-loops is based on in vitro evidence. 
(DOCX) 



Text SI Supplemental references. 

(DOCX) 

Acknowledgments 

We thank Prof. P. Cook, Prof. N.J. Proudfoot, Prof. C. Norbury and L. M. 
Silva for critically reading the manuscript. 



References 

1. Thomas M, White RL, Davis RW (1976) Hybridization of RNA to double- 
stranded DNA: formation of R-loops. Proc Natl Acad Sci U S A 73: 2294- 
2298. 

2. White RL, Hogness DS (1977) R loop mapping of the 18S and 28S sequences 
in the long and short repeating units of Drosophila melanogaster rDNA. Cell 
10: 177-192. 

3. Drolet M, Bi X, Liu LF (1994) Hypernegative supercoiling of the DNA 
template during transcription elongation in vitro. J Biol Chem 269: 2068—2074. 

4. Aguilera A, Garcia-Musc T (2012) R loops: from transcription byproducts to 
threats to genome stability. Mol Cell 46: 1 15-124. 

5. Hampcrl S, Cimprich KA (2014) The contribution of co-transcriptional 
RNA:DNA hybrid structures to DNA damage and genome instability. DNA 
Repair (Amst) 19: 84-94. 



Skourti-Stathaki K, Proudfoot NJ (2014) A double-edged sword: R loops as 
threats to genome integrity and powerful regulators of gene expression. Genes 
Dev 28: 1384-1396. 

Wahba L, Gore SK, Koshland D (2013) The homologous recombination 
machinery modulates the formation of PvNA-DNA hybrids and associated 
chromosome instability. Elifc 2: e00505. 

Boguslawski SJ, Smith DE, Michalak MA, Mickelson KE, Yehle CO, ct al. 
(1986) Characterization of monoclonal antibody to DNA.RJNA and its 
application to immunodetection of hybrids. J Immunol Methods 89: 123— 
130. 

Phillips DD, Garboczi DN, Singh K, Hu Z, Lcppla SH, ct al. (2013) The sub- 
nanomolar binding of DNA-RNA hybrids by the single-chain Fv fragment of 
antibody S9.6. J Mol Recognit 26: 376-381. 



PLOS Genetics | www.plosgenetics.org 



7 



September 2014 | Volume 10 | Issue 9 | e1004630 



10. Hu Z, Zhang A, Storz G, Gottesman S, Lcppla SH (2006) An antibody-based 
microarray assay for small RNA detection. Nucleic Acids Res 34: c52. 

11. El Hage A, French SL, Beyer AL, Tollcrvcy D (2010) Loss of Topoisomerase I 
leads to R-loop-mediated transcriptional blocks during ribosomal RNA 
synthesis. Genes Dev 24: 1546-1558. 

12. Mischo HE, Gomez-Gonzalez B, Grzechnik P, Rondon AG, Wei W, ct al. 
(2011) Yeast Senl hclicase protects the genome from transcription-associated 
instability. Mol Cell 41: 21-32. 

13. Sun Q,, Csorba T, Skourti-Stathaki K, Proudfoot NJ, Dean C (2013) R-loop 
stabilization represses antisense transcription at the Arabidopsis FLC locus. 
Science 340: 619-621. 

14. Bcchcrel OJ, Yco AJ, Stellati A, Hcng EY, Luff J, ct al. (2013) Senataxin plays 
an essential role with DNA damage response proteins in meiotic recombination 
and gene silencing. PLoS Genet 9: el003435. 

15. Yco AJ, Bcchcrel OJ, Luff JE, Cullcn JK, Wongsurawat T, ct al. (2014) R- 
Loops in Proliferating Cells but Not in the Brain: Implications for AOA2 and 
Other Autosomal Recessive Ataxias. PLoS ONE 9: e90219. 

16. Skourti-Stathaki K, Proudfoot NJ, Gromak N (201 1) Human senataxin resolves 
RNA/DNA hybrids formed at transcriptional pause sites to promote Xrn2- 
dependent termination. Mol Cell 42: 794-805. 

17. Ginno PA, Lim YW, Lott PL, Korf I, Chcdin F (2013) GC skew at the 5' and 
3' ends of human genes links R-loop formation to epigenetie regulation and 
transcription termination. Genome Res 23: 1590-1600. 

18. Ginno PA, Lott PL, Chnstcnscn HC, Korf I, Chedin F (2012) R-loop 
formation is a distinctive characteristic of unmethylatcd human CpG island 
promoters. Mol Cell 45: 814-825. 

19. Xu B, Clayton DA (1996) RNA-DNA hybrid formation at the human 
mitochondrial heavy-strand origin ceases at replication start sites: an 
implication for RNA-DNA hybrids serving as primers. EMBOJ 15: 3135— 
3143. 

20. Itoh T, Tomizawa J (1980) Formation of an RNA primer for initiation of 
replication of ColEl DNA by ribonuclcasc H. Proc Natl Acad Sci U S A 77: 
2450-2454. 

21. Yu K, Chcdin F, Hsich CL, Wilson TE, Licber MR (2003) R-loops at 
immunoglobulin class switch regions in the chromosomes of stimulated B cells. 
Nat Immunol 4: 442-45 1 . 

22. Chan YA, Aristizabal MJ, Lu PY, Luo Z, Hamza A, ct al. (2014) Genome- 
Wide Profiling of Yeast DNA:RNA Hybrid Prone Sites with DRIP-Chip. PLoS 
Genet 10: el004288. 

23. Tous C, Aguilera A (2007) Impairment of transcription elongation by R-loops 
in vitro. Biochcm Biophys Res Commun 360: 428-432. 

24. Huertas P, Aguilera A (2003) Cotranscriptionally formed DNA:RNA hybrids 
mediate transcription elongation impairment and transcription-associated 
recombination. Mol Cell 12: 711-721. 

25. Nakama M, Kawakami K, Kajitani T, Urano T, Murakami Y (2012) DNA- 
RNA hybrid formation mediates RNAi-directed hcterochromatin formation. 
Genes Cells 17: 218-233. 

26. Castellano-Pozo M, Santos-Pereira JM, Rondon AG, Barroso S, Andujar E, 
et al. (2013) R loops are linked to histone H3 S10 phosphorylation and 
chromatin condensation. Mol Cell 52: 583—590. 

27. Yang Y, McBridc KM, Hcnslcy S, Lu Y, Chcdin F, ct al. (2014) Arginine 
mcthylation facilitates the recruitment of TOP3B to chromatin to prevent R 
loop accumulation. Mol Cell 53: 484-497. 

28. Santos-Pereira JM, Herrcro AB, Garcia-Rubio ML, Marin A, Moreno S, et al. 

(2013) The Npl3 hnRNP prevents R-loop-mediated transcription-replication 
conllicts and genome instability. Genes Dev 27: 2445-2458. 

29. Sordct O, Redon CE, Guirouilh-Barbat J, Smith S, Seller S, et al. (2009) 
Ataxia telangiectasia mutated activation by transcription- and topoisomerase I- 
induced DNA double-strand breaks. EMBO Rep 10: 887-893. 

.30. Li X, ManleyJL (2005) Inactivation of the SR protein splicing factor ASF/SF2 
results in genomic instability. Cell 122: 365-378. 

31. Stirling PC, Chan YA, Minaker SW, Aristizabal MJ, Barrett I, ct al. (2012) R- 
loop-mcdiated genome instability in mRNA cleavage and polyadenylation 
mutants. Genes Dev 26: 163-175. 

32. Helmrich A, Ballarino M, Tora L (2011) Collisions between replication and 
transcription complexes cause common fragile site instability at the longest 
human genes. Mol Cell 44: 966-977. 

33. I'uduri S, Crabbc L, Conti C, Tourriere H, Holtgrcvc-Grcz H, ct al. (2009) 
Topoisomerase I suppresses genomic instability by preventing interference 
between replication and transcription. Nat Cell Biol 11: 1315—1324. 

34. Wahba L, AmonJD, Koshland D, Vuica-Ross M (201 1) RNase H and multiple 
RNA biogenesis factors cooperate to prevent RNA:DNA hybrids from 
generating genome instability. Mol Cell 44: 978—988. 

35. Herrcra-Moyano E, Mergui X, Garcia-Rubio ML, Barroso S, Aguilera A 

(2014) The yeast and human FACT chromatin-reorganizing complexes solve 
R-loop-mediated transcription-replication conllicts. Genes Dev 28: 735-748. 

36. Cerritelli SM, Crouch RJ (2009) Ribonuclcasc H: the enzymes in eukaryotes. 
FEBSJ 276: 1494-1505. 

37. Boulc JB, Zakian VA (2007) The yeast Piflp DNA hclicase preferentially 
unwinds RNA DNA substrates. Nucleic Acids Res 35: 5809-5818. 

38. ShinJH, Kelman Z (2006) The replicativc helicases of bacteria, archaea, and 
cukarya can unwind RNA-DNA hybrid substrates. J Biol Chcm 281: 26914- 
26921. 



39. Morcira MC, Klur S, Watanabe M, Nemeth AH, Le Ber I, et al. (2004) 
Senataxin, the ortholog of a yeast RNA hclicase, is mutant in ataxia-ocular 
apraxia 2. Nat Genet 36: 225-227. 

40. Anheim M, Monga B, Flcury M, Charles P, Barbot C, ct al. (2009) Ataxia with 
oculomotor apraxia type 2: clinical, biological and genotype/phenotype 
correlation study of a cohort of 90 patients. Brain 132: 2688-2698. 

41. Chen YZ, Bennett CL, Huynh HM, Blair IP, Puis I, et al. (2004) DNA/RNA 
hclicase gene mutations in a form of juvenile amyotrophic lateral sclerosis 
(ALS4). Am J Hum Genet 74: 1 128-1 135. 

42. Surawccra A, Lim Y, Woods R, Birrell GW, Nasim T, ct al. (2009) Functional 
role for senataxin, defective in ataxia oculomotor apraxia type 2, in 
transcriptional regulation. Hum Mol Genet 18: 3384-3396. 

43. Steinmetz EJ, Warren CL, Kuehner JN, Panbehi B, Ansari AZ, ct al. (2006) 
Gcnomc-widc distribution of yeast RNA polymerase II and its control by Senl 
hclicase. Mol Cell 24: 735-746. 

44. West S, Gromak N, Proudfoot NJ (2004) Human 5'^3' cxonuclcasc Xrn2 
promotes transcription termination at co-transcriptional cleavage sites. Nature 
432: 522-525. 

45. Surawccra A, Bcchcrel OJ, Chen P, Rundle N, Woods R, et al. (2007) 
Senataxin, defective in ataxia oculomotor apraxia type 2, is involved in the 
defense against oxidative DNA damage. J Cell Biol 177: 969-979. 

46. Yucc O, West SC (2013) Senataxin, defective in the neurodegenerative 
disorder ataxia with oculomotor apraxia 2, lies at the interface of transcription 
and the DNA damage response. Mol Cell Biol 33: 406-417. 

47. Alzu A, Bermcjo R, Begnis M, Lucca C, Piccini D, et al. (2012) Senataxin 
associates with replication forks to protect fork integrity across RNA- 
polymcrase-II-transcribed genes. Cell 151: 835-846. 

48. Richard P, Feng S, ManleyJL (20 1 3) A SUMO-dcpendent interaction between 
Senataxin and the exosome, disrupted in the neurodegenerative disease AOA2, 
targets the exosome to sites of transcription-induced DNA damage. Genes Dev 
27: 2227-2232. 

49. Vantaggiato C, Bondioni S, Airoldi G, Bozzato A, Borsani G, ct al. (2011) 
Senataxin modulates neurite growth through fibroblast growth factor 8 
signalling. Brain 134: 1808-1828. 

50. Powell WT, Coulson RL, Gonzales ML, Crary FK, Wong SS, ct al. (2013) R- 
loop formation at Snordll6 mediates topoteean inhibition of Ubc3a-antiscnsc 
and allcle-spccific chromatin decondensation. Proc Natl Acad Sci U S A 110: 
13938-13943. 

51. Helmrich A, Ballarino M, Nudler E, Tora L (2013) Transcription-replication 
encounters, consequences and genomic instability. Nat Struct Mol Biol 20: 
412-418. 

52. Chon H, Sparks JL, Rychlik M, Nowotny M, Burgers PM, ct al. (2013) RNase 
H2 roles in genome integrity revealed by unlinking its activities. Nucleic Acids 
Res 41: 3130-3143. 

53. Cerritelli SM, Frolova EG, Feng C, Grinberg A, Love PE, et al. (2003) Failure 
to produce mitochondrial DNA results in embryonic lethality in Rnasehl null 
mice. Mol Cell 11: 807-815. 

54. Rydberg B, Game J (2002) Excision of misincorporated ribonucleotides in 
DNA by RNase H (type 2) and FEN-1 in cell-free extracts. Proc Natl Acad 
Sci USA 99: 16654-16659. 

55. Nick McElhinny SA, Kumar D, Clark AB, Watt DL, Watts BE, ct al. (2010) 
Genome instability due to ribonucleotide incorporation into DNA. Nat Chcm 
Biol 6: 774-781. 

56. Rcijns MA, Rabe B, Rigby RE, Mill P, Astell KR, ct al. (2012) Enzymatic 
removal of ribonucleotides from DNA is essential for mammalian genome 
integrity and development. Cell 149: 1008-1022. 

57. Lazzaro F, Novarina D, Amara F, Watt DL, Stone JE, et al. (2012) RNase H 
and postreplication repair protect cells from ribonucleotides incorporated in 
DNA. Mol Cell 45: 99-110. 

58. Arana ME, Kerns RT, Wharey L, Gcrrish KE, Bushel PR, ct al. (2012) 
Transcriptional responses to loss of RNase H2 in Saccharomyces cerevisiae. 
DNA Repair (Amst) 11: 933-941. 

59. Crow YJ, Leitch A, Hayward BE, Garner A, Parmar R, et al. (2006) Mutations 
in genes encoding ribonuclcasc H2 subunits cause Aicardi-Goutieres syndrome 
and mimic congenital viral brain infection. Nat Genet 38: 910-916. 

60. Crow YJ, Rchwinkel J (2009) Aicardi-Goutieres syndrome and related 
phenotypes: linking nucleic acid metabolism with autoimmunity. Hum Mol 
Genet 18: R130-136. 

61. Crow YJ, Hayward BE, Parmar R, Robins P, Leitch A, et al. (2006) Mutations 
in the gene encoding the 3'-5' DNA cxonuclcasc TREX1 cause Aicardi- 
Goutieres syndrome at the AGS1 locus. Nat Genet 38: 917-920. 

62. Rice GI, Kasher PR, Forte GM, Mannion NM, Greenwood SM, ct al. (2012) 
Mutations in ADAR1 cause Aicardi-Goutieres syndrome associated with a type 
I interferon signature. Nat Genet 44: 1243-1248. 

63. Rice GI, Bond J, Asipu A, Brunette RL, Manficld IW, ct al. (2009) Mutations 
involved in Aicardi-Goutieres syndrome implicate SAMHD1 as regulator of 
the innate immune response. Nat Genet 41: 829-832. 

64. Lce-Kirsch MA, Wolf C, Gunthcr C (2014) Aicardi-Goutieres syndrome: a 
model disease for systemic autoimmunity. Clin Exp Immunol 175: 17-24. 

65. Rabe B (2013) Aicardi-Goutieres syndrome: clues from the RNase H2 knock- 
out mouse. J Mol Med (Bcrl) 91: 1235-1240. 

66. Hillcr B, Achlcitncr M, Glagc S, Naumann R, Bchrcndt R, et al. (2012) 
Mammalian RNase H2 removes ribonucleotides from DNA to maintain 
genome integrity. J Exp Med 209: 1419-1426. 



PLOS Genetics | www.plosgenetics.org 



8 



September 2014 | Volume 10 | Issue 9 | e1004630 



67. Bubeck D, Reijns MA, Graham SC, Astell KRJones EY, et al. (201 1) PCNA 
directs type 2 RNasc H activity on DNA replication and repair substrates. 
Nucleic Acids Res 39: 3652-3666. 

68. Rigby RE, Webb LM, Mackenzie KJ, Li Y, Leitch A, ct al. (2014) RNA:DNA 
hybrids are a novel molecular pattern sensed by TLR9. EMBO J 33: 542-558. 

69. Lopez Castel A, ClearyJD, Pearson CE (2010) Repeat instability as the basis 
for human diseases and as a potential target for therapy. Nat Rev Mol Cell Biol 
11: 165-170. 

70. Grabczyk E, Mancuso M, Sammarco MC (2007) A persistent RNA.DNA 
hybrid formed by transcription of the Friedreich ataxia triplet repeat in live 
bacteria, and by T7 RNAP in vitro. Nucleic Acids Res 35: 5351-5359. 

71. Lin Y, Dent SY, Wilson JH, Wells RD, Napierala M (2010) R loops stimulate 
genetic instability of CTG.CAG repeats. Proc Nad Acad Sci U S A 107: 692- 
697. 

72. Mclvor EI, Polak U, Napierala M (2010) New insights into repeat instability: 
role of RNA*DNA hybrids. RNA Biol 7: 551-558. 

73. Rcddy K, Tarn M, Bowatcr RP, Barber M, Tomlinson M, ct al. (2011) 
Determinants of R-loop formation at convergent bidirectionally transcribed 
trinucleotide repeats. Nucleic Acids Res 39: 1749-1762. 

74. Bclotserkovskii BP, Mirkin SM, Hanawalt PC (2013) DNA sequences that 
interfere with transcription: implications for genome function and stability. 
Chcm Rev 1 13: 8620-8637. 

75. Groh M, Lufino MM, Wadc-Martins R, Gromak N (2014) R-loops associated 
with triplet repeat expansions promote gene silencing in Friedreich ataxia and 
fragile X syndrome. PLoS Genet 10: cl004318. 

76. Loomis EW, Sanz LA, Chedin F, Hagerman PJ (2014) Transcription- 
Associated R-Loop Formation across the Human FMR1 CGG-Rcpcat Region. 
PLoS Genet 10: el 004294. 

77. Colak D, Zaninovic N, Cohen MS, Rosenwaks Z, Yang WY, ct al. (2014) 
Promoter-bound trinucleotide repeat mRNA drives epigenetie silencing in 
fragile X syndrome. Science 343: 1002—1005. 

78. Hacuslcr AR, Donnelly CJ, Pcriz G, Simko EA, Shaw PG, ct al. (2014) 
C9orl72 nucleotide repeat structures initiate molecular cascades of disease. 
Nature 507: 195-200. 

79. Hanahan D, Weinberg RA (201 1) Hallmarks of cancer: the next generation. 
Cell 144: 646-674. 

80. Negrini S, Gorgoulis VG, Halazonetis TD (2010) Genomic instability-an 
evolving hallmark of cancer. Nat Rev Mol Cell Biol 1 1 : 220-228. 

81. Chcrnikova SB, Razorenova OV, HigginsJP, Sishc BJ, Nicolau M, ct al. (2012) 
Deficiency in mammalian histone H2B ubiquitin ligase Brel (Rnf20/Rnf 10; 
leads to replication stress and chromosomal instability. Cancer Res 72: 2111- 
2119. 

82. Bhatia V, Barroso SI, Garcia-Rubio ML, Tumini E, Herrera-Moyano E, et al. 
(2014) BRCA2 prevents R-loop accumulation and associates with TREX-2 
mRNA export factor PCID2. Nature 511: 362-365. 

83. Jackson BR, Nocrenberg M, Whitehouse A (2014) A novel mechanism 
inducing genome instability in Kaposi's sarcoma-associated herpesvirus 
infected cells. PLoS Pathog 10: cl004098. 

84. Sikdar N, Bancrjcc S, Zhang H, Smith S, Myung K (2008) Spt2p defines a new 
transcription-dependent gross chromosomal rearrangement pathway. PLoS 
Genet 4: el 000290. 

85. San Filippo J, Sung P, Klein H (2008) Mechanism of eukaryotic homologous 
recombination. Annu Rev Biochem 77: 229-257. 



86. Chaudhuri J, Tian M, Khuong C, Chua K, Pinaud E, et al. (2003) 
Transcription-targeted DNA deamination by the AID antibody diversification 
enzyme. Nature 422: 726-730. 

87. Robbiani DF, Nussenzweig MC (2013) Chromosome translocation, B cell 
lymphoma, and activation-induced cytidine deaminase. Annu Rev Pathol 8: 
79-103. 

88. Ruiz JF, Gomez-Gonzalez B, Aguilcra A (2011) AID induces double-strand 
breaks at immunoglobulin switch regions and e-MYC causing chromosomal 
translocations in yeast THO mutants. PLoS Genet 7: el002009. 

89. Duquette ML, Pham P, Goodman MF, Maizels N (2005) AID binds to 
transcription-induced structures in c-MYC that map to regions associated with 
translocation and hypermutation. Oncogene 24: 5791-5798. 

90. Burns MB, Lackey L, Carpenter MA, Rathore A, Land AM, ct al. (201.3) 
APOBEC3B is an enzymatic source of mutation in breast cancer. Nature 494: 
366-370. 

91. KulisM, EstcllcrM (2010) DNA methylation and cancer. Adv Genet 70: 27-56. 

92. Wongsurawat T, Jenjarocnpun P, Kwoh CK, Kuznetsov V (201 1) Quantitative 
model of R-loop forming structures reveals a novel level of RNA-DNA 
interactome complexity. Nucleic Acids Res 40: el6. 

93. Hyncs NE, Stoelzle T (2009) Key signalling nodes in mammary gland 
development and cancer: Myc. Breast Cancer Res 11: 210. 

94. Balk B, Maichcr A, Dees M, Klermund J, Lukc-Glascr S, et al. (2013) 
Telomeric RNA-DNA hybrids affect telomcre-lcngth dynamics and senes- 
cence. Nat Struct Mol Biol 20: 1199-1205. 

95. Pfciffcr V, Crittin J, Grolimund L, Lingncr J (2013) The THO complex 
component Thp2 counteracts telomeric R-loops and telomere shortening. 
EMBOJ 32: 2861-2871. 

96. Dc Amicis A, Pianc M, Ferrari F, Fanciulli M, Delia D, ct al. (201 1) Role of 
senataxin in DNA damage and telomeric stability. DNA Repair (Amst) 10: 
199-209. 

97. Wheelhouse RT, Chaires JB (2010) Drug binding to DNA X RNA hybrid 
structures. Methods Mol Biol 613: 55-70. 

98. Shaw NN, Arya DP (2008) Recognition of the unique structure of DNA:RNA 
hybrids. Biochimic 90: 1026-1039. 

99. Shaw NN, Xi H, Arya DP (2008) Molecular recognition of a DNA:RNA 
hybrid: sub-nanomolar binding by a neomyein-methidium conjugate. Bioorg 
Med Chem Lett 18: 4142-4145. ' 

100. Cassidy SB, Dykcns E, Williams CA (2000) Prader-Willi and Angelman 
syndromes: sister imprinted disorders. Am J Med Genet 97: 136-146. 

101. Mcng L, Person RE, Bcaudct AL (2012) Ubc3a-ATS is an atypical RNA 
polymerase II transcript that represses the paternal expression of Ubc3a. Hum 
Mol Genet 21: 3001-3012. 

102. White R, Saxty B, Large J, Kctdcborough CA, Jackson AP (2013) Identification 
of small-molecule inhibitors of the ribonuclease H2 enzyme. J Biomol Screen 
18: 610-620. 

103. Sipova H, Zhang S, Dudley AM, Galas D, Wang K, et al. (2010) Surface 
plasmon resonance biosensor for rapid label-free detection of microribonucleic 
acid at subfemtomole level. Anal Chem 82: 101 10-10115. 

104. Qavi AJ, Kindt JT, Gleeson MA, Bailey RC (201 1) Anti-DNA:RNA antibodies 
and silicon photonic microring resonators: increased sensitivity for multiplexed 
microRNA detection. Anal Chcm 8.3: 5949-5956. 

105. Gomez-Gonzalez B, Garcia-Rubio M, Bermejo R, Gaillard H, Shirahige K, 
et al. (2011) Genome-wide function of THO/TREX in active genes prevents 
R-loop-dependent replication obstacles. EMBOJ 30: 3106-3119. 



PLOS Genetics | www.plosgenetics.org 



9 



September 2014 | Volume 10 | Issue 9 | e1004630 



